Continuous Uncertainty in Trio
نویسندگان
چکیده
We present extensions to Trio for incorporating continuous uncertainty into the system. Data items with uncertain possible values drawn from a continuous domain are represented through a generic set of functions. Our approach enables precise and efficient representation of arbitrary probability distribution functions, along with standard distributions such as Gaussians. We also describe how queries are processed efficiently over this representation, without knowledge of specific distributions. For queries that cannot be answered exactly, we can provide approximate answers using sampling or histogram approximations, offering the user a cost-precision trade-off. Our approach exploits Trio’s lineage and confidence features, with smooth integration into the overall data model and system.
منابع مشابه
Trio-One: Layering Uncertainty and Lineage on a Conventional DBMS∗
Trio is a new kind of database system that supports data, uncertainty, and lineage in a fully integrated manner. The first Trio prototype, dubbed Trio-One, is built on top of a conventional DBMS using data and query translation techniques together with a small number of stored procedures. This paper describes Trio-One’s translation scheme and system architecture, showing how it efficiently and ...
متن کاملTrio-One: Layering Uncertainty and Lineage on a Conventional DBMS (Demo)
Trio is a new kind of database system that supports data, uncertainty, and lineage in a fully integrated manner. The first Trio prototype, dubbed Trio-One, is built on top of a conventional DBMS using data and query translation techniques together with a small number of stored procedures. This paper describes Trio-One’s translation scheme and system architecture, showing how it efficiently and ...
متن کاملTrio-ER: The Trio System as a Workbench for Entity-Resolution
Entity-resolution (also known as deduplication, record linkage, and reference reconciliation, among others) was one of the original motivating applications [6] for the Trio system, which has been under development at Stanford over the past several years. • Entity-resolution is the process of determining when multiple data records are likely to represent the same real-world entity, and possibly ...
متن کاملSIMULATING CONTINUOUS FUZZY SYSTEMS: I
In previous studies we first concentrated on utilizing crisp simulationto produce discrete event fuzzy systems simulations. Then we extendedthis research to the simulation of continuous fuzzy systems models. In this paperwe continue our study of continuous fuzzy systems using crisp continuoussimulation. Consider a crisp continuous system whose evolution depends ondifferential equations. Such a ...
متن کاملAn Introduction to ULDBs and the Trio System
We introduce ULDBs: relational databases that add uncertainty and lineage of the data as first-class concepts. The ULDB model underlies the Trio system under development at Stanford. We describe the ULDB model, then present TriQL, our SQL-based query language for ULDBs. TriQL’s semantics over ULDBs is defined both formally and operationally, and TriQL extends SQL with constructs for querying li...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009